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1. INTRODUCTION 


The availability of large quantities of digital (numerical) mul tl spectral 
scanner data (MSS) from the Land Satellite (Landsat) system has contributed 
greatly to the rapid development of remote sensing technology. Parallel 
evolution of machine processing techniques for these data has resulted from 
desires for greater accuracy and for the ability to monitor larger and larger 
areas. One of the major beneficiaries of these developments has been 
agriculture. Remote sensing provides a unique data source for monitoring food 
and fiber production on a worldwide scale. 

At the heart of such applications systems lies the processing and presentation 
of the basic digital data. Effective, yet simple. Isolation of the 
Informational content of such data Is required. The digital data are 
frequently transformed to enhance features of particular Interest and used 
either directly in numerical algorithms or through image interpretation of 
film products. Quantitative estimates of plant characteristics such as yield, 
quality, and growth stage have frequently relied on direct use of digital data 
through statistically derived predictive equations. Crop Identification and 
acreage estimation generally have involved the use of numerical pattern 
recognition in conjunction with visual interpretation of film products. 
Typically, training data for supervised classification algorithms have been 
provided through the Identification and labeling of homogeneous areas In 
photographic products. Generally, the mul tl temporal progression of colors of 
a given field or fields is used for labeling purposes. The colors are 
determined by an analyst from a false color-infrared (IR) film product using 
Landsat bands 4, 5, and 7.* 

The purpose of this paper Is to present a proposed transformation between 
digital data and color space. The underlying assumption is that transforming 
multispectral data into color notation provides a powerful analysis 


^Henceforth, In this paper we will refer to Landsat MSS bands 4 through 7 as 
channels 1 through 4. 
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tool for development of objective techniques useful In both Imaqe 
interpretation and numerical modeling. In other words, color can serve as the 
primary mechanism for extracting Information from Landsat data. The 
previously mentioned use of multi temporal color sequences In labeling Is an 
obvious example. A stable transform between digital data and film products 
would allow for identification, verification, and utilization of subtle 
variations in color. These are normally difficult to quantify, yet they 
supply essential Information In the decision logic used by experienced image 
Interpreters. This transform would reduce the subjective nature of the 
analyst decisions that makes results difficult to replicate. Ultimately, the 
movement from research and development to general usage would become a 
transfer of technology rather than of experience and art. In addition, a 
substantial body of agricultural literature exists which relates plant 
characteristics such as stress (moisture and nutrient), quality and key 
morphological changes to color. 

Significant variations, both within scenes over time and between scenes, 
produced by atmospheric radiative processes have hindered the quantitative 
i ntemretation of Landsat digital data. Radiative transfer models have been 
used to identify and evaluate the magnitude of the sources of variation 
(Lambeck and Potter, 1979). However, application of these models for purposes 
of correcting the Landsat signal has been impractical due to the operational 
shortage of data on key parameters such as total optical depth. To date, 
attempts to solve this problem In application have met with only limited 
success. Sufficient variability has remained such that it has only been 
feasible to classify Landsat measurements within individual scenes. For 
similar reasons, algorithms to estimate plant characteristics have shown a 
tendency to be dominated by gross location-to-location variations and be 
relatively insensitive to within-scene variations. 

The transform discussed in the following sections is intended tu provide a 
stable transformation between digital data and color space which maximizes the 
spatial and temporal comparability of Landsat data. The color space is both 
logical in terms of color theory and compatible with the hardware systems used 
to produce color film products. 


2. SCENE STANDARDIZATION 


The spatial and temporal variability of agricultural scenes Is Illustrated In 
figures 1 through 4, which are plots of 18 550 random picture elements 
(pixels) sampled from 371 Landsat acquisitions from 35 LAC IE spring wheat 
ground-truth sites during the spring, summer, and fall of 1978. The pixel 
values are plotted against the means of the acquisitions from which they were 
sampled. These figures show that about half of the variability amonq pixel 
channel measurements can be accounted for by variability In the means of the 
acquisitions in which they occur. This Is a one to one relationship, with a 
slope of one and an intercept of zero. The variability in the means of the 
acquisitions can be caused by both scene-wide atmospheric effects 
(illumination, transmission, scattering, calibration, noise) and differences 
In scene composition (proportions of plants and soils of varying 
appearance). Such sources of variation are shown in table 1. These are 
difficult to estimate separately, but the combined multiplicative effect can 
be neutralized approximately If all pixel channel values are divided by the 
channel mean for the acquisition. However, some of the sources of variation 
(especially scattering and noise) have an additive effect and tend to mask the 
signal. Hence, the utility of the channel mean preprocessing may be reduced 
as atmospheric haze increases. A subsequent paper will discuss the results of 
simulation and statistical studies designed to elucidate further the extent to 
which the efficiency of the mean standardization technique is affected by the 
various components of acquisition variability. Details of the proposed 
preprocessing technique are presented in the next paragraph. 

With the assumption that the additive and multiplicative terms are the same 
for both the scene and a element within the scene, the standardization may be 
written as follows: 

m. S. (element) + a* 

nr - 1 * 1 

i m i Sj (scene) + a^ 

where the standardized data are expressed as relative energy (RE), 
represents the measured spectral signal for the 1^ Landsat band and m^ and a, 
are multiplicative and additive terms, respectively. 


TABLE 1.- SOURCES OF VARIATION IN MEANS OF LANOSAT DATA ACQUISITIONS 


Variable 

Source 

Effect type 

Multiplicative 

Additive 

Illumination 

Sun angle, atmospheric 
scattering, path trans- 
mission 

X 


Transmission 

Zenith path transmission, 
atmospheric scattering 

X 


Path radiance 

Atmospheric backscatter 


X 

Cal ibration 

Electronics degradation 

X 


Noise 

System components 


X 

Scene composition 

Differences in amounts of 
soils and plants of 
varying appearance 

X 
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Figure 3.- The relationship between scene means and dots selected randomly 
from spring wheat scenes, Landsat channel 3. 



Figure 4.- The relationship between scene means and dots selected randomly 
from spring wheat scenes, Landsat channel 4. 


Alternatively, the standardization can be expressed a*; follows: 

1. Scene channel means (CHj) are individually standardized to a constant, K, 
so that K/CH.J = channel correction factor ( CCF ^ ) 

2. Each digital value, by channel, is then multiplied by the appropriate 
correction factor to obtain a relative energy (RE) value 

REf = CCF.j * raw value 

For use in numerical algorithms, we have chosen to assign K the arbitrary 
value of 5, corresponding to the midpoint of a 0 to 10 relative energy 
scale. This practice provides continuity with other colorimetric systems, 
such as the Munsell (1963), which place brightness on a 0 to 10 scale. A 
value of K = 128 is used in the creation of false color imagery as discussed 
in the next section. 

It can be shown mathematically that the data structure (see the derivation in 
the appendix 2 ) is unchanged by this transformation. Table 2 gives empirical 
verification of this assertion, using the 18 o50 observations plotted in 
figures 1 through 4. The table shows the between-channel correlation matrix 
for both the raw digital data and the transformed values. Clearly, the 
interchannel correlation is not materially affected by a transformation of 
this type. 


2 The authors would like to acknowledge the aid of Or. R. S. Chhikara in 
preparing the statistical analyses of the correlation properties of the 
scene normalization procedure, as presented in the appendix of this document. 
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TABLE 

2.- 

CORRELATION 

STRUCTURE OF 

THE LANDCAT 

DIGITAL DATA 


FOR 

18 550' RANDOM SAMPLES OF 

SPRING WHEAT 

SCENES 




(a) Raw data 




Channel 1 

Channel 2 

Channel 3 

Channel 4 

Channel 

l 

1.000 




Channel 

2 

.920 

1.000 



Channel 

3 

.520 

.352 

1.000 


Channel 

4 

.301 

.113 

.940 

1.000 




(b) 

Relative energy 

values 




Channel 

1 Channel 2 

Channel 3 

Channel 4 

Channel 

1 

1.000 




Channel 

2 

.906 

1.000 



Channel 

3 

.473 

.351 

1.000 


Channel 

4 

.286 

.134 

.937 

1.000 
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3. COLOR TRANSFORMATION 


In most classical theory, color is held to be determined by the absolute 
amounts of radiation in the three primary bands or channels of green, blue, 
and red. However, E. H. Land and his colleagues at Polaroid have amassed 
considerable evidence that human perception of repeatable color is better 
correlated with relative amounts of radiation rather than with absolute 
radiation (Land, 1974, 1977). Land's definition of relative varies but Is 
approximately equivalent to the percentage of incident radiation. However, 
the behavior of the Landsat data illustrated in figures 1 through 4 suggests 
that a better scientific color reference point is the normalized scene mean. 

The proposed standardization procedure provides numbers that are relative to a 
neutral mean in which all channel means are equal to five. Our hypothesis is 
that all colors can be defined or classified by their positions within a cube 
whose three primary color axes (blue, green and red) have been scaled from 
zero to ten in these standardized numbers, called relative enerqies. A 
logical corollary is that human perception of color is analogous to this 
relative energy standardization and classification. 

There is considerable empirical evidence for the validity of the above 
hypothesis. Indeed, it seems to explain the effectiveness of a whole class of 
devices used to convert digital data to color representations on either film 
or cathode ray tube (CRT) screens. One such device is the Production Film 
Converter (PFC). It consists of a CRT light source with three filters, blue, 
green, and red. The amount of illumination through each filter for each pixel 
is controlled by Landsat digital tapes. Illumination is expressed in gun- 
counts and these range from 0 to 255. The machine is calibrated so that an 
illumination of 255 gun-counts in each channel will just produce colorless 
transparency (white) when the film product is viewed on a standard light 
table. The Landsat digital data are converted to gun-counts in a manner which 
results in the scene mean of each channel being equal to 128 gun-counts. This 
is a balancing technique to ensure that the scene mean is halfway between 
maximum opacity (black, created by zero gun-counts in each channel) and 
maximum transparency (white, created by 255 gun-counts in each channel). 

A 

n 


Thus, it Is a linear system equivalent to our hypothesized color cube with 
K = 128, rather than K = 5, for calculation of gun-counts. Fiqure 5 shows a 
flow diagram for the steps by which the system reproduces the relative eneroy 
distribution of the original scene. Each step is postulated to be a linear 
transformation from MSS counts to gun-counts to film exposure to film 
transmission equivalent to original relative energy. 

The hue distribution within cubic systems of the PFC type is shown in figure 6, 
(a) and (b). Figure 6(a) shows the cube standing on Its black vertex, with 
the white vertex projecting toward the viewer. The other vertices of the cube 
are labeled with the primary hue which each represents, toqether with the 
relative energy quantities required to produce them. Thus, each point in the 
cube is the vector addition of the three relative energies and colors are 
combined by vector addition. Figure 6(b) is a side view of the cube which 
shows that the three additive primaries (blue, green, and red) are one-third 
of the way from black to white, whereas the three subtractive primaries 
(yellow, cyan, and magenta) are one-third of the way from white to black. 

This is logical since the additive primaries combine to form white, while the 
subtractive primaries combine to form black. The symmetry is completed by the 
fact that all complementary pairs are exactly opposite each other, e.g., 
yellow and blue, magenta and green, and cyan and red. 

The same color distribution described above was presented in 1952 by 
Hickethier, based on empirical evidence in the printing industry, which 
utilizes subtractive primaries (Hickethier, 1952). Of course, the PFC is 
equivalent to a combination of additive primaries (gun-counts) and subtractive 
primaries (film emulsion c(yes). 

Color is usually considered to consist of three components, hue, value, and 
chroma. Hue represents the dominant wavelength, value is relative brightness, 
and chroma is the saturation or purity, vis-a-vis gray. The relationship 
between these components arid the color cube are illustrated in fiqure 7. We 
have adopted the following conventions for mathematical designation of these 
properties. Hue is expressed in degrees (and letters) following the hue 
diagram shown in figure 8. 



GC p1xe1 


128 

m: 


MSS 


pixel 


GC = 128 
GCmax =255 


RE p1xel 


5 

R5s 


MSS 


pixel 


RE = 5 
RE MAX = 


10 


*Sensor counts expressed as multiples of scene mean. 


Figure 5.- Color normalization and reproduction using relative 

energy (RE) concept. 



LEGEND 



lepicted toward viewer resting on black vertex 


Hue 


Dominant Wave 


Chroma Grey— ~ Saturated Color 
Value Black-^White 


Figure 7.- Relationship of hue, value, and chroma 
notation to the color cube. 
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Figure 8.- Diagram of :olor cube hue notation. 


Chroma Is expressed In units of relative energy. (The maximum chroma possible 
Is 8.165 and occurs at the primary apices of the cube.) Value Is expressed on 
a scale of 0 to 10 along the black-white line, and each unit has a length of 
/T relative energy units. The following are calculations for each of these 
color components. 

Value = S/3 


Chroma 

Hue 



2 V X r X 2 


*‘hh-hh’hh 


where 


2 2 2 
A = Xj + + X3 

s = x x + x 2 + x 3 

Xj = relative energy for channel 1 

X 2 = relative energy for channel 2 

X3 = relative energy for channel 4 

For interpretive purposes, we have found that it is helpful to segment the 

color cube according to channel ratios or rankings, since these may represent 
major differences which are directly attributable to physical causes such as 
the presence or absence of chlorophyll. The relationship between channel 
rankings and hues is shown in figure 9. These rankings have led us to 
hypothesize a generalized color seguence for agricultural crops. This is 
presented in table 3. Variations from this sequence, together with value and 
chroma differences, may serve to identify additional phenol ogi cal chanaes and 
help to differentiate among crops. Once the species and growth stage of a 
crop have been determined, we think that residual color differences may reveal 
stress and damage, and that field texture may indicate stand deficiencies. 
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TABLE 3.- GENERALIZED COLOR SEQUENCE FOR AGRICULTURAL CROPS 
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The first large-scale test of the color sequence approach utilized spectral 
data and associated ground truth for 456 spring wheat fields distributed In 
1978 over 35 sites In North Dakota, South Dakota, Montana, and Minnesota. 
Approximately 90 percent of these fields were found to follow our postulated 
color sequence during the normal spring small grain growing season. The 
remaining fields were unidentifiable due to Insufficient spectral data as a 
result of clouds, etc. These results demonstrate that the proposed 
standardization and color transformation make possible the comparison of 
Landsat data acquired over widely varying points In space and time. 



4. SUMMARY AND CONCLUSIONS 


A method has been found for removing most of the scenewide acqulsltlon-to- 
acquisltlon variability In Landsat data. The resulting mean-corrected values, 
called relative energies, can be further transformed Into a color space which 
permits direct comparison of visual and computerized Interpretations of MSS 
digital data. Preliminary tests Indicate that the procedures discussed In 
this paper permit mul tl temporal normalization and classification of Landsat 
signatures over */?1d*j areas. Mul tl temporal color sequences derived by these 
techniques appear to correspond with ground- truth observations on crop 
development In the major spring wheat production area of the United States. 
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APPENDIX A 


CORRELATION STRUCTURE IN THE NORMALIZED DATA 


CORRELATION STRUCTURE IN THE NORMALIZED DATA 


Let 


x 1jk = 


x i.1kl 

x 1jk2 



be the measurement vector for k th pixel of j th field in Uh scene. Define 


5X 


i jk 


W = 5 


'i.. 



where _ is the scene mean. Due to a large number of pixels in a scene 


Hi 


KJ 

where is the actual mean of the population of pixels represented by the i th 
scene. (Dot on top of equality sign means approximation.) 




Let n i j be the number of pixels In jt/j field of scene 1. Denote 


1.1. = if)j £ x uk 
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For the transformed data, the estimated covariance matrix is given by 
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Where 


<■( i ) = __ 1 
3 it,m l n.< - T 
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EE 
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(X ijk* ' “it* (X ijkm 



which is the within- segment covariance matrix for scene i. Thus, 
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The correlation (p & m ) between channels z and m given by 

S, 


£,m 


*,m 


> z,z m,m 


does not depend upon the mean vector ^ . It follows that 



Accordingly, the correlation structure is not changed at all by the 
transformation of to Y^. 


